POLYCOST: A telephone-speech database for speaker recognition
نویسندگان
چکیده
This article presents an overview of the POLYCOST database dedicated to speaker recognition applications over the telephone network. The main characteristics of this database are: large mixed speech corpus size (> 100 speakers), English spoken by foreigners, mainly digits with some free speech, collected through international telephone lines, and more than eight sessions per speaker.
منابع مشابه
Speaker verification on the polycost database using frequency filtered spectral energies
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a first or second order FIR filter have proved to be competitive for speech recognition. Recently, the authors have shown that this frequency filtering can approximately equalize the cepstrum variance enhancing the oscillations of the spectral envelope curve that are most effect...
متن کاملA comparative study of speaker verification systems using the polycost database
This paper reports on a comparative study of several automatic speaker verification systems using the Polycost database. Polycost is a multi-lingual database with non-native English and mother-tongue speech by subjects from 14 countries. We present results for the first three baseline experiments defined for the database as well as explore the multi-lingual aspects of Polycost in a number of ex...
متن کاملSpeaker Recognition Using Frequency Filtered Spectral Energies
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a simple first or second order FIR filter have proved to be an efficient speech representation in terms of both speech recognition rate and computational load. Recently, the authors have shown that this frequency filtering can approximately equalize the cepstrum variance enhanci...
متن کاملDatabases for Speaker Recognition: Activities in Cost250 Working Group 2
Working Group (WG) 2 of the COST250 Action “Speaker Recognition in Telephony” has dealt with databases for speaker recognition. The present paper gives an overview of the activities in this WG, and presents its main results. The first result is an overview of 36 existing databases that has been used in speaker recognition research. Those include both public and proprietary databases. As part of...
متن کاملGuidelines for experiments on the POLYCOST database
The purpose of this document is to define a common ground for speaker recognition experiments on the POLYCOST database. It is done by defining a set of baseline experiments for which results always should be included when presenting evaluations made on this database. By including these results and by presenting the differences introduced in new experiments, a comparison between systems tested o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 31 شماره
صفحات -
تاریخ انتشار 2000